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LISTING OF CLAIMS 

This listing of claims will replace all prior versions and listings of claims in the 
Application. 

1 . (Currently Amended) A method for processing data in a distributed architecture, the 
method comprising the steps of: 

receiving a work request that identifies at least one repository for processing, wherein the 
at least one identified repository is included in a plurality of repositories; 

determining a repository type of the at least one repository; 

determining a spider type for gathering information content from the at least one 
identified repository, wherein the spider type is determined based on the repository type; 

gathering information content from the at least one id e ntifi e d repository identified in 
accordanc e with the work request; 

registering the information content; 

assigning the information content to at least one document identifier; 
transmitting,the work request regarding at l e ast a portion of t he information content to a 
first work queue; 

processing the work request by generating a meta-document representation of the at least 
abortion of the information content; 

transmitting the meta-document representation to a second work queue; and 
analyzing the meta-document representation. 

2. (Cancelled) 

3. (Previously Presented) The method of claim 1, wherein the meta-document 
representation comprises extensible markup language (XML) format. 

4. (Previously Presented) The method of claim 1, wherein the step of analyzing the 
meta-document representation comprises metrics extraction of the meta-document 
representation. 
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5. (Previously Presented) The method of claim 1, wherein the step of analyzing the 
meta-document representation comprises: 

indexing the meta-document representation. 

6. (Previously Presented) The method of claim 1, further comprising the step of: 
generating progress statistics regarding the step of analyzing the meta-document representation. 

7. {Original) The method of claim 6, further comprising the step of: 
transmitting the progress statistics to a third work queue. 

8. (Original) The method of claim 1, wherein the first work queue and the second work 
queue share access to a central data structure. 

9. (Original) The method of claim 8, wherein access is shared via a CORBA service. 

10. (Previously Presented) The method of claim 8, wherein the central data structure 
represents at least one of a metrics history and taxonomy regarding the information content. 

1 1 . (Currently Amended) A system for processing data in a distributed architecture, the 
system comprising: 

a scheduling module that identifies at least one repository for processing, determines a 
repository type of the at least one repository, determines a spider type for gathering information 
content from the at least one identified repository, and generates a work request that identifies 
the at least one repository for processing, 

wherein the spider type is determined based on the repository type, and the repository is 
included in a plurality of repositories; 

an information content gathering module that gathers information content from the 
id e ntifi e d repository identified in accordanc e with the work request; 

a registering module that registers the information content; 

an assigning module that assigns the information content at least one document identifier; 
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a work request transmitting module that transmits the work request regarding at l e ast a 
portion of the information content to a first work queue; 

a work request processing module that processes the work request by generating a meta- 
document representation of the at least a p ortion of the information content; 

an information content transmitting module that transmits the meta-document 
representation to a second work queue; and 

an information content processing module that analyzes the meta-document 
representation. 

12. (Cancelled) 

13. (Previously Presented) The system of claim 11, wherein the meta-document 
representation comprises extensible markup language (XML) format. 

14. (Previously Presented) The system of claim 11, wherein the information content 
processing module that analyzes the meta-document representation comprises: 

a metrics extraction module that performs metrics extraction on the meta-document 
representation. 

15. (Previously Presented) The system of claim 11, wherein the information content 
processing module that analyzes the meta-document representation comprises: 

an indexing module that indexes the meta-document representation. 

16. (Previously Presented) The system of claim 11, further comprising: 

a generating module that generates progress statistics regarding the analyzing of the 

* 

meta-document representation. 

17. (Original) The system of claim 16, further comprising: 

a progress statistics transmitting module that transmits the progress statistics to a third 
work queue. 
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18. (Original) The system of claim 11, wherein the first work queue and the second 
work queue share access to a central data structure. 

19. (Original) The system of claim 18, wherein access is shared via a CORBA service. 

20. (Previously Presented) The system of claim 18, wherein the central data structure 
represents at least one of a metrics history and taxonomy regarding the information content. 

21. (Currently Amended) A system for processing data in a distributed architecture, the 
system comprising: 

scheduling means for identifying at least one repository for processing, determining a 
repository type of the at least one repository, determining a spider type for gathering information 
content from the at least one identified repository, and generating a work request that identifies 
the at least one repository for processing, 

wherein the spider type is determined based on the repository type, and the repository is 
included in a plurality of repositories; 

gathering means for gathering information content from the id e ntified repository 
identified in accordanc e with the work request; 

registering means for registering the information content; 

assigning means for assigning the information content at least one document identifier; 

work request transmitting means for transmitting the work request regarding at l e ast a 
portion of the information content to a first work queue; 

work request processing means for processing the work request by generating a meta- 
document representation of the at least a p ortion of the information content; 

information content transmitting means for transmitting the meta-document 
representation to a second work queue; and 

information content processing means for analyzing the meta-document representation. 

22. (Cancelled) 
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23. (Previously Presented) The system of claim 21, wherein the meta-document 
representation comprises extensible markup language (XML) format. 

24. (Previously Presented) The system of claim 21, wherein the information content 
processing means for analyzing the meta-document representation comprises: 

metrics extraction means for performing metrics extraction on the meta-document 
representation. 

25. (Previously Presented) The system of claim 21, wherein the information content 
processing means for analyzing the meta-document representation comprises: 

indexing means for indexing the meta-document representation. 

26. (Previously Presented) The system of claim 21, further comprising: 

progress statistics generating means for generating progress statistics regarding the analyzing of 
the meta-document representation. 

27. (Original) The system of claim 26, further comprising: 

progress statistics transmitting means for transmitting the progress statistics to a third 
work queue. 

28. (Original) The system of claim 21, wherein the first work queue and the second 
work queue share access to a central data structure. 

29. (Original) The system of claim 28, wherein access is shared via a CORBA service. 

30. (Previously Presented) The system of claim 28, wherein the central data structure 
represents at least one of a metrics history and taxonomy regarding the information content. 

3 1 . (Currently Amended) A processor readable medium comprising processor readable 
code embodied therein for causing a processor to process data in a distributed architecture, the 
medium comprising: 
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work request receiving code that causes a processor to receive a work request that 
identifies at least one repository for processing, wherein the at least one identified repository is 
included in a plurality of repositories; 

repository type determining code that causes a processor to determine a repository type of 
the at least one repository; 

spider type determining code that causes a processor to determine a spider type for 
gathering information content from the at least one identified repository, wherein the spider type 
is determined based on the repository type; 

information content gathering code that causes a processor to gather information content 
from the id e ntified repositor y identified in accordanc e with the work request; 

registering code that causes a processor to register the information content; 

assigning code that causes a processor to assign the information content at least one 
document identifier; 

work request transmitting code that causes a processor to transmit the work request 
regarding at least a portion of the information content to a first work queue; 

work request processing code that causes a processor to process the work request by 
generating a meta-document representation of the at least a p ortion of the information content; 
information content transmitting code that causes a processor to transmit the meta-document 
representation to a second work queue; and 

information content processing code that causes a processor to analyze the meta- 
document representation. 

32. (Cancelled) 

33. (Previously Presented) The medium of claim 3 1 , wherein the meta-document 
representation comprises extensible markup language (XML) format. 

34. (Previously Presented) The medium of claim 31, wherein the information content 
processing code that causes a processor to analyze the meta-document representation comprises: 

categorizing code that causes a processor to categorize the meta-document representation. 
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35. (Previously Presented) The medium of claim 31, wherein the information content 
processing code that causes a processor to analyze the meta-document representation comprises: 

indexing code that causes a processor to index the meta-document representation. 

36. (Previously Presented) The medium of claim 31, further comprising: 

generating code that causes a processor to generate progress statistics regarding the analyzing of 
the meta-document representation. 

37. (Original) The medium of claim 36, further comprising: 

progress statistics transmitting code that causes a processor to transmit the progress 
statistics to a third work queue. 

38. (Original) The medium of claim 31, wherein the first work queue and the second 
work queue share access to a central data structure. 

39. (Original) The medium of claim 38, wherein access is shared via a CORBA service. 

40. (Previously Presented) The medium of claim 38, wherein the central data structure 
represents at least one of a metrics history and taxonomy regarding the information content. 

41 . (Previously Presented) The method of claim 1 wherein the step of analyzing the 
meta-document representation comprises categorizing the meta-document representation. 

42. (Previously Presented) The system of claim 1 1 , wherein the information content 
processing module that analyzes the meta-document representation comprises a categorizing 
module that categorizes the meta-document representation. 

43. (Previously Presented) The system of claim 21, wherein the information content 
processing means for analyzing the meta-document representation comprises categorization 
means for categorizing the meta-document representation. 
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44. (Previously Presented) The medium of claim 31, wherein the information content 
processing code that causes a processor to analyze the meta-document representation comprises 
categorization code that categorizes the meta-document representation. 
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